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Abstract 

Simulating a cellular automaton (CA) for t time-steps into the future 
requires t 2 serial computation steps or t parallel ones. However, certain 
CAs based on an Abelian group, such as addition mod 2, are termed linear 
because they obey a principle of superposition. This allows them to be 
predicted efficiently, in serial time 0(t) or 0(\ogt) in parallel. 

In this paper, we generalize this by looking at CAs with a variety of 
algebraic structures, including quasigroups, non-Abelian groups, Steiner 
systems, and others. We show that in many cases, an efficient algorithm 
exists even though these CAs are not linear in the previous sense; we term 
them quasilinear. We find examples which can be predicted in serial time 
proportional to t, tlogt, tlog 2 t and t a for a < 2, and parallel time logt, 
log t log log t and log 2 t. 

We also discuss what algebraic properties are required or implied by 
the existence of scaling relations and principles of superposition, and ex- 
hibit several novel "vector-valued" CAs. 



1 Introduction: CAs as algebras 

A Cellular Automaton is a dynamical system on sequences . . . a_iao&i . . . where 
the dj are symbols in some finite alphabet A. It is updated by a local function 
/, usually written 

&i f^P'i— ri • - • i ■ • • ) ^i+r) 

where r is the radius of the rule. 

In this paper, we will consider r = 1/2 rules, with a staggered space-time as 
shown in figure 1; a CA of any radius can be transformed into one with r = 1/2 
by grouping sets of 2r sites together. Each new state is a function of just two 
predecessors, 

a 'i = Z( a i-l/2> a i+l/2) 

or 

a = b • c 
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Figure 1: The staggered space-time of an r = 1/2 CA. 



where • is a binary operation on A. This approach was also taken in 




Like any dynamical system, we would like to know how hard a particular 
CA is to predict. In particular, suppose we are given a finite stretch of initial 
conditions, ao . . .at. After t time-steps, there is a single site at the bottom of 
a light-cone with this initial row on top. Call the state of the CA at that site 
s = Pt(ao, oi, . . . , at), where P t is the prediction function. 

In general, P t is quite messy; the light-cone looks like 



a ai a 2 a 3 

aoai a\a2 0203 



s 



(aoai)(aia 2 ) (aia 2 )(a 2 a 3 ) ^ 
= ((aoai)(aia2))((aia 2 )(a2a 3 )) 



where we have abbreviated a • b as simply ab. Clearly, P t can always be calcu- 
lated in t(t + l)/2 = 0(t 2 ) serial steps or t parallel ones, simply by simulating 
the CA and calculating all the products in the light-cone above s. But if • 
fulfills certain algebraic properties, we may be able to calculate Pt much more 
efficiently. Discussing how different algebraic structures on • affects the com- 
plexity of Pt is the aim of this paper. 

There is an enormous amount of variety between easily predictable CAs, such 
as addition mod 2, and computationally universal ones, such as in We hope 
to begin to map out this region, rising a few steps above the very simplest rules, 
and explore the grey area between linearity and computational universality. 

This may be relevant to the use of CAs in cryptography; several partially 
linear CA rules have been proposed as cryptosystems, either as random number 
generators || or as iterated encrypting functions || . Obviously these CAs would 
be much less useful for this purpose if they turn out to be efficiently predictable. 



2 Preliminaries 

We say (A,m) is a quasigroup if the left and right division properties hold: i.e., 
for all a, b there is some c such that a • c = 6, and some d such that dua — b. We 
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can also express this by saying that left and right multiplication by any element 
a is a one-to-one operation. If an r = 1/2 CA is a quasigroup, it is one-to-one 
on both its inputs; this is also referred to as left and right permutive. 

If a CA is a quasigroup, this ensures that every row has exactly n = \A\ 
pre-images, since we can choose one site in the pre-image and then use left and 
right division to fill in the rest of the sites. So quasigroup CAs are surjective 
(onto) but not reversible (since there's more than one pre-image). 

The multiplication tables of quasigroups are Latin squares, \A\ by \A squares 
where each element occurs exactly once in each row and each column 12]| . 

A left (right) identity is an element e such that e*a = a{a*e — a). An 
identity is both a left and a right identity. A quasigroup with identity is called 
a loop. In a CA, an identity will be invaded by any other state. 

A left (right) zero is an element z such that z • a = z [a • z = z), so that it 
spreads and invades all other states. Clearly no quasigroup can have a zero. 

An element a is idempotent if a • a = a. Domains of idempotents will per- 
sist, possibly invaded from left and right, so they will typically form triangular 
domains in the space-time history of the CA. 

We say • is associative if (a • o) • c = a • (b • c) for all a, b, a (A, •) is then a 
semigroup. 

If an identity exists, a left (right) inverse of a is an element a -1 such that 
a -1 »a = e (a»a _1 = e). Left and right inverses are the same if • is associative. 

If • is associative, has an identity e, and has an inverse a~ 1 for every a, then 
(A, •) is a group. Every associative quasigroup is a group, since the division 
properties allow us to find inverses and an identity. 

We say • is commutative if a • b = b • a for all a, b. Commutative groups 
are also called Abelian. We will often use the symbols + and for • and e 
respectively for an Abelian group. 

The Abelian group {0, 1, ... ,p — 1} with addition mod p is called the cyclic 
group Z p . 

If (A, •) and (B, •) are two groups (or quasigroups, or whatever), their direct 
sum A © B is the set of vectors (A x B, • ), where (a, b) • (c, d) = (a • c, b • d). 

A function h from (A, •) to (B, *) is a homomorphism if h{a*b) — h(a)*h(b). 
A homomorphism that is one-to-one and onto is an isomorphism, and we say A 
and B are isomorphic if one exists; i.e., A and B are essentially identical except 
for a relabelling of their elements. An isomorphism from A to itself is called an 
automorphism. 

Since composition o is an associative operation, the set of functions or homo- 
morphisms from A to A form a semigroup, with the identity function 1(a) = a. 
Since automorphisms have inverses, they form a group. 

Homomorphisms on a commutative group (A, +) can also be summed, by 
defining (/ + g)(a) = g(a) + f(a); so they also form a group under +. The two 
operations o and + obey a distributive property, namely f°(g + h) — f°g + f°h. 
The map 0(a) = 0, where is the identity of A, is the identity of + but a zero 
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of o. (This kind of two-operation structure is called a ring.) In the text, we will 
usually abbreviate fog as fg. 

Our model of serial computation will be a Random Access Machine (RAM). 
It has an arbitrary number of registers, each of bounded length, which it can 
access in constant time. In each computation step it can only manipulate a 
finite number of bits (so, for instance, it takes 0(n) steps to add two rt-digit 
numbers). Many of our algorithms will work just as well on a multitape Turing 



Our model of parallel computation will be families of Boolean circuits. The 
class NCj is the set of functions that can be calculated by circuits with a 
polynomial number of gates and depth log J n where n is the number of bits of 
input (n — c(t +1) for our purposes, for an alphabet with 2 C symbols). This is 
equivalent to a Parallel Random Access Machine (PRAM) that computes the 
function in time 0(\og 3 1). The union of all the NCj is NC; i.e., functions that 
can be computed in polylogarithmic parallel time by circuits of polynomial size. 

(Technical note: we are using CREW (Concurrent Read, Exclusive Write) 
PRAMs where several processors can read, but not write to, the same register 
at once. In RAMs and PRAMs are defined with each register holding an 
arbitrary integer, and with addition as an elementary operation that can be 
carried out in one step. We feel that bounding the number of bits in each 
register gives results more relevant to real computers. 

Finally, by log-' n we mean log n raised to the jth power, not log composed 
j times.) 

3 Abelian groups 

If (A, •) is an Abelian group, we'll write + and instead of • and e respectively. 
Equation (1) then becomes 

Qq Ol 02 03 



machine 



a + ai 



ai + a 2 



a>2 + a-3 



(2) 



a + 2ai + a 2 



ai + 2a 2 + a 3 



Oq + 3ai + 3a 2 + a 3 



So we can write 




or 



where G is a Green's function, 
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where a x — a and all other a, = 0. In this case G x ,t is a scalar multiplication, 
so we will simply identify it with the Pascal's Triangle coefficient and write 

G x ,t = { 

By the Chinese Remainder theorem, any Abelian group can be written as a 
direct sum A = Z^yi ffi Z P2 t X © • • • © Zp k r k where the Pi are primes — so all we 
need to know Jj], Q is G x ^ modpi ri for each i. 

But each of these coefficients can be calculated in the following way: since 

x ^^j = ^2;^ — ^ / ^ + 1)7 we start with Go.t = 1 and then use this 

recurrence to go from G x t to G x+ i_ t - We calculate G Xit as follows: write G x t = 
(p r ) m b where p r does not divide b. Then 



G mod p r 



if m ^ 

b mod p r if m = 0. 



So at each step, all we need to do is change the value of b mod p r and increment 
or decrement m. But we can similarly write t — x = (p r ) mi bi and x + 1 = 
(p r ) m2 b 2 , and then 

G x+1 . t = (p r ) m 'b' = G x /—^ = (p r ) m+mi - m2 bbi/b 2 

x + 1 

If r = 1, then m' = m + mi — m2 and 6' = bb\jbi mod p. If r > 1, we increment 
vn! if 661 /&2 has r factors of p. 

So as x ranges from to t, we have to: 

1) calculate mi, 7712,61 and 62 

2) update m and & accordingly; and 

3) add all the G x ^ t a x to get s. 

On a standard digital computer, counting all the increment and carry steps, 
step (1) takes time proportional to the m's, step (2) takes logm time to update 
m (for the addition) and constant time to update b (since arithmetic mod p r is 
a finite-state process) and step (3) takes constant time. So the total time is 



E 

x=0 



ci(mi + 771,2) + C2 logmax(m, 1711,1712) + C3 < c^t 



since the leading behavior comes from the first term and X)L=o mi ^ ^ (similarly 
for 7772). 

So, we've reproduced the result of jjj that for an 'additive' CA, for which 
(A, •) is an Abelian group, prediction can be done in time 0(t) using the prin- 
ciple of superposition — much faster than the time 0{t 2 ) full simulation takes. 
(This is also a lower bound, since any CA that depends on all its inputs takes 
linear time to read them.) 
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In m, a different method was used, in which we treat the Green's function 
as a polynomial JjJ and use the scaling properties of polynomials in a finite ring. 
We review this technique here since we will use it later on. Let 

G t (x) = G\(x) = (l+x) t 

where x represents a position operator and Gn is the coefficient of x l in Gt- 
Again, we just need to calculate G mod a finite number of primes and prime 
powers. 

If p is a prime, a classic theorem says that, for any polynomial P, 
P v (x) =p P(x p ). In particular, 

G p {x) = (1 + x) p = p l + x p = dixP) 

so that 

P p (a ,ai, . . . ,a p ) = a + a p = Pi(a ,a p ) 

In other words, p steps of the CA are equivalent to a single step on a dilated 
set of initial conditions: the CA obeys a scaling relation with a scaling ratio of 
p. For a prime power q — p m , this isn't quite true, but we still have 

G q (x) = p Gpm-i (x p ) so G p ™ (x) = p Gp-m-i (x p + ) for n > m — 1 

so 

P pn (ao,a\, . . . ,a p n) = P Pm _ 1 (x ,x pn - m+ i, . . . ,x p ») 

and the CA scales after the first p m ~ x steps. 

In either case, we can use this scaling relation to calculate the final state 
s from just a few intermediate states. For prime p, jump up by p k d\{t) where 
k = [\og p t\ is the largest power of p in t's base p expansion, and dx(t) is its 
most significant digit. 

We only need to know d\(t) + 1 sites in this row, since (1 + x) p dl = p 
(1 + x p ) dl has only d\ +1 non-zero terms. Then jump up d2(t)p k ~ 1 from there, 
and so on until we get to the top row. As is apparent from figure 2, the total 
number of sites we need to calculate is 

k i 

1 + (di + 1) + (di + l)(da + 1) + • • • = 11^ + ^ 

i=0 j = l 

which, in the worst case where t = p k+1 — 1 and dj = p — 1 for all j, is 

k 

^ i p-i 

so we get our answer in linear time. 



G 



r=2, t=7: 



7 total sites 



r=3, t=16: 



9 total sites 




Figure 2: Using scaling for addition mod a prime p. The scaling ratio is r = p, 

k k 

and (1 + x) p = 1 + x p . The total number of sites we need in the lower case 
where p — 3 and t = 16 is 9 (here d\ = 1, d 2 = 2, and d 3 = 1). 



q=4, t-15: 



26 total sites 




Figure 3: Scaling for addition mod a prime power q — p m . Here q = 4, so 
(1 + x) 2 = 1 + 2x 2 + x 2 . The total number of sites we need to calculate for 
t = 15 is 26. 
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For a prime power q = p m , first simulate t mod p m ~ x steps (which only takes 
linear time) so that the remaining time is a multiple of p m ~ x . Then we do the 
same thing, jumping up in powers of p, except that this time 

has at most p m ~ 1 di + 1 non-zero terms. In addition, many of the intermediate 
sites we need to calculate overlap, as shown in figure 3, so the worst case is 

^ 771 — 1 

Since we need to do this for a finite number of primes or prime powers, we have 
Theorem 1. An Abelian group CA can be predicted in serial time 0(t). 
Since the scaling technique gives us a tree of depth log p t, with 0{t) nodes, 

we can also say 

Theorem 2. An Abelian group CA can be predicted in parallel time O(logi) 
by 0(t) processors, so its prediction is in NCi. 

Now let's move beyond Abelian groups. When do the same prediction algo- 
rithms work? When do they break down, and what can replace them? 

4 Commutative semigroups 

If • is commutative and associative, then (A, •) is a commutative semigroup. 
Again, let's use the symbol + instead of •. If an identity doesn't already exist, 
we can add one, called 0, such that a + = + a = a. 

Just as for Abelian groups, the light-cone (1) can be simplified into a Green's 
function sum (2) and (3); except that multiplication by the coefficient G is 
actually the function G(a) = a + a + ■ ■ ■ + a . This is a homomorphism whenever 

G times 

A is a commutative semigroup. 

Now since G(a) = (G — l)(a) + a, we can start with the identity function 
1(a) = a and add it iteratively to find what action G(a) has on A. But since 
A is finite, it only has a finite number of homomorphisms; so this iteration will 
eventually enter a finite loop. So for all G greater than some j, G(a) will only 
depend on a and G mod p for some p — i.e., G(a) is eventually periodic in G. 
So except for a number of G's bounded by a constant, all we have to know is 
G mod p for some p. 

Since calculating G mod p is equivalent to finding the Green's functions for 
the CA based on Zp, the rest of the proof follows as above, and we have 

Theorem 3. A CA with a commutative semigroup operation can be predicted 
in time 0(t) in serial or 0(\ogt) in parallel, so its prediction is in NCi- 
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5 CAs separably isotopic to Abelian groups: Ad- 
ditive CAs with matrix coefficients 

Two Latin squares (n x n squares where each one of n symbols occurs exactly 
once in each row and each column) are said to be isotopic if they can be trans- 
formed into each other by permuting the rows, the columns, or the set of symbols 
& 

Recall that Latin squares can also be thought of as the multiplication tables 
of quasigroups. Then we can express isotopy between two quasigroups (A, •) 
and (A, *) as 

h(a.b)=g(a)*f(b) 

where /, g, and h are one-to-one functions on A, corresponding to permutations 
on the rows, columns, and symbols respectively. 

Clearly, an isotopy represents a similarity in the structures of • and *, but 
a much looser one than an isomorphism in which f = g = h. In this section we 
will examine quasigroups that are isotopic to an Abelian group (A, +), in the 
hope that their "group-like" structure will help us predict them efficiently 

First, if • is isotopic to *, there is always an operation o, isomorphic to •, 
which is isotopic to * with h the identity (this is called a principal isotope of *) 
[ fn^ . So it's sufficient to drop h and just write 

a*b = g{a)+f{b). 

Then, the evolution of the CA looks like 

a a 1 a 2 

ff(ao)+/(ai) g( ai )+f(a 2 ) (4) 

g(g(a ) + f( ai )) + f(g( ai ) + f(a 2 )) 

Clearly, the isotopy isn't being much help. We can only take advantage of +'s 
properties as an Abelian group if we can break terms like g(g(ao) + /(<Ji)) into 
two separate parts, one depending on ao and one depending on a\. 
For what / and g can we do that? Call a function / separable if 

f(a + b) = u(a) + v(b) 

for some u and v. 

Now since + is a group, it has an identity 0, and inverses —a for each 
a. Then if we write u'(a) = u(a) — u(0) and v'(a) — u(0) + v(a), we have 
f(a + b) = u'(a) + v'(b) with u'(0) = 0. So it's sufficient to assume that 
u(0) = 0. 

But then f(a) = /(0 + a) = u(0) + v(a) — v(a), so / = v, and f(a) 
fin ■ 0: = u(a) + /(0), so / = u + f(0). Moreover, u(a + b) = /(a + &)-/(0) = 
u(a) + f(b) — /(0) = u(a) + u(b), so u is a homomorphism. So we've proved that 
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any separable function / is actually a homomorphism u, plus a constant /(0). 
(Note that we have not used the commutativity of + to prove this, so this is 
true for groups in general.) 

Henceforth, we will restrict our attention to quasigroups that are separably 
isotopic to an Abelian group; i.e., where / and g are homomorphisms plus 
constants, so that f(a + b) = f(a) + f(b) — f(0) and g(a + b) = g(a)+g(b) — g(0). 
(Here we are using the commutativity of +.) Then the light-cone (4) becomes 

a a\ a 2 a 3 

d(ao) + f(ai) g(a 1 ) + f(a 2 ) g(a2) + f(a 3 ) 

g 2 (ao) + (gf + fg)(ai) + f 2 (a 2 ) - (/ + g)(0) ff 2 ( ftl ) + (gf + fg)(a 2 ) + f 2 (a 3 ) - (/ + g)(0) 

5 3 M + (g 2 f + gfg + /<7 2 )(<*i) + (gf 2 + fgf + f 2 g){a 2 ) + / 3 M - (/ + , 9 ) 2 (o) - 4(/ + 5 )(o) 

where the factor of 4 appears because f(a + b + c + ■ ■ •) = /(a) + /(&) + /(c) + 
• • • — fc/(0) where k is the number of +'s. 

So in addition to powers of / and g, we have a constant equal to 

t-i 



-^T(2* + z-2)(/ + 9)^(0) 



i=i 

Except for a finite number of terms this sum will just depend on (2 l +i— 2) mod p 
for some p, which is eventually periodic, and (t — i) mod q for some q, since 
(/ + gY~ l i s eventually periodic. So each term can be computed in constant 
time — and the whole sum in time linear in t. 

Now that we know how to separate out the effect of /(0) and g(0), let's 
assume for the remainder of this section that /(0) = g(0) — 0; i.e., that / 
and g are homomorphisms. We then have a Green's function as before, with 
P t = J2x=o Gx,t( a x), as we had for Abelian groups and semigroups; except now, 
G is a kind of non-commuting Pascal's Triangle 



1 

/ g 
f 2 fg + gf g 2 

f /-'// • /.<// • '//- fg 2 + g.fg + g 2 f g 3 



(5) 



where each term represents a path to that site from the initial site. 

This is a generalization of so-called additive CAs: rather than scalar coeffi- 
cients, such as a = 26 + 3c mod 5, we now have matrix coefficients (for instance, 
2x2 matrices mod 2 if the underlying group is Z|) which don't necessarily 
commute. 

Once the G x . t are calculated, we can calculate the sum P t — J2 X G x .ta x in 
serial time t or logi in parallel. However, this isn't going to help us much unless 
the G x .t are fairly easy to calculate. If each one takes t steps, for instance, then 
calculating all of them will take t 2 time, no better than complete simulation. 
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In (5), the number of terms in G x j is 



and the total number of terms 



grows as 2* — hardly what we need for an efficient algorithm. However, if / and 
g fulfill certain conditions, we'll be able to simplify (5) considerably. (Some of 
the following results will generalize to non-quasigroup isotopies, for instance if 
a • b — f(a) + g(b) where + is a semigroup, or where / and g are non-invertible 
matrices and so are not one-to-one.) 



5.1 / and g commute 

If fg = 9f, (5) becomes 



or in general, 



/ 
3fg 

Gx.t 



1 

Vg 



g 

3/5 2 



ft-x g x 



Since / and g are one-to-one, f p — 1 for some p, so f l only depends on i mod p. 
Similarly for g; so f t ~ x g x can be calculated in constant time, and the sum 
X) G Xt t(a x ) can again be calculated in time 0(t). In fact, this also works for 
CAs isotopic to an Abelian semigroup, where / and g are commuting homo- 
morphisms. 

This includes the usual "additive" CAs, where the underlying group is cyclic 
and / and g are scalars. 



5.2 f 



9' 



If p = g 2 = 1 , for even time-steps (5) becomes 



1 



1 

3h 



1 

2h 
3 + 3h 2 



1 
h 

2 + h 2 
6h + h 3 



1 

2h 
3 + 3h 2 



1 

3ft 



1 



where h = fg + gf. So even if / and g don't commute, all three functions that 
appear after two time-steps, / 2 , g 2 , and fg + gf, commute with each other; so 
we can use a r = 1 Pascal's Triangle to get 



t/2 

G x , t = 

fc=0,2,4.. 



t/2 
k 



t/2-k \ k 
(x-k)/2j n 



t/2 

E 



ii 

2 ' 



M x—k | t—x—k | 
fe=0,2,4... R - 2 • 2 • 



(6) 
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for even x. (For odd x, we sum over k = 1,3,5... instead.) The first binomial 
coefficient chooses which k steps in the path to that term in the triangle go 
straight down, gaining k factors of h, and the second chooses which of the re- 
maining steps go to the right. Their product is a trinomial coefficient I 2 _ k 

\k, — T - 

For odd t, we can easily derive G x ^ from its two predecessors with even t. 
But how can we calculate the sum (6) efficiently? 

We can use the same polynomial scaling as in section 3. If we define 
G(h, x, t) = (1 + hx + x 2 )*, we get 

G(h, x, p) = (l + hx + x 2 ) p = p 1 + h p x p + x 2p = G(h p , x p , 1) 

and 

G(h,x,q) = p G(h p ,x p ,p m ~ 1 ) 

for a prime power q — p m . So by decomposing the group into primes and prime 
powers as we did before, we can predict the CA in linear time (except since 
G(h,x, 1) has three terms, m is effectively 1 for primes and m + 1 for prime 
powers). 

In fact, we can generalize this to the following: 

Theorem 4. Consider a CA separably isotopic to a Abelian group. If there 
exists a t such that all the G x j commute, then the CA can be predicted in serial 
time 0{t) or parallel time O(logt), so its prediction is in NCi. 

5.3 f 2 and g 2 commute with / and g 

If #/ 2 — f 2 9 an( i 9 2 f — f9 2 i then we can group / 2 s to the left and g 2 s to the 
right. Every term in (5) then has the form f a (gf) b g c , as in 

1 

/ 9 

f 2 fg + gf g 2 

f 2f 2 g + fgf g.fg + 2fg 2 g 3 

Equation (6) still applies with a small modification: f 2 and g 2 don't disappear, 
but they can be moved aside, giving 

x >* 7 j 7 | x — k | t—x—k | " 

fe=0,2,4... 2 ' 2 • 

for even x and even t, where h = fg + gf as before. Then we use the same 
algorithm as above, and again take serial time 0(t) or O(logi) in parallel. 
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5.4 [/, g] commutes with / and g 



Suppose / and g don't commute with each other, but their commutator [g, f) = 
dfg 1 / 1 does. Call this ft. Then gf = hfg and we can move fs to the left 
and gs to the right: for instance, gfgf = h 3 f 2 g 2 . 
Then (5) becomes 

1 

/ 9 

f 2 (1 + /*)/<? 9 2 

f (l + h + h 2 )pg (l + h + h 2 )fg 2 g 3 

We get increasing powers of ft from the left-hand predecessors, since gf t ~ x g x = 
fot-x jt-xgx+i _ Since each path from the initial site chooses x places to turn 
right, and each one adds t — x factors of ft, we have 

x{t— x) 

G x ,t = f- X g x ]T U(k,t-x,x)h k 

k=0 

where Il(fc, to, n) is the number of ways to partition k indistinguishable objects 
into n piles of to to objects each. For instance, 11(5, 4, 3) = 4: (4, 1, 0), (3, 2, 0), 
(3, 1, 1), and (2, 2, 1). II satisfies a number of identities, including II(fc, to, n) = 
II(fc, n, to) and U(k, to, n) = Il(fc — m, m, n — 1) + Il(fc, to — 1, n). 

Since / and g are automorphisms, ft is also, so h r — 1 for some r. So in fact 
we just need to calculate 

r-l /[x(t-x)/r\ \ 

53 53 n^+ir.t-s.a;) ft fc 
fc=0 \ j=o J 

mod a finite number of primes or prime powers as before. Perhaps the reader 
can find an efficient way to calculate this series. 

5.5 Two examples: vector-valued CAs 

We now show two vector-valued CAs — where the state is a vector and the 
coefficients / and g are matrices. In particular, our states will be two-component 
vectors mod 2 (the group Zf), and our coefficients will 2x2 matrices mod 2 
(the homomorphisms on that group). 

The automorphisms (invertible matrices) of Zf, are 

(o i)'(i o)'(i i)'(i o)'(o l)'(l l) 

which the reader can check are the identity, two period-3 rotations and three 
period-2 transpositions of the non-zero elements. 
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Since we already know what to do if / and g commute, we want a non- 
commuting pair. There are only two non-isomorphic ways to do this: a pair of 
distinct transpositions, and a transposition and a rotation. In the first case, for 

instance/. f° l) g = ( I ] \ we have f = ? = 1 and we can use 



\ o ) - y V 1 , 

section 5.2 above. 

5.5.1 Logarithmic scaling 

The other case is more interesting. Let / = ^ ^ ^ ^ and <? = ^ j q ^ . Then 
/ 2 = S 3 = 1, /5 + 5/ = /> + 1 = .9 2 , and .9/ = /.g 2 : so the Green's function is 

1 

/ g 
1 / g 2 
f g 2 o i 

10 1/ 5 

f g f g 2 f g 2 

1 / 3 1 

/ 5 2 / g 2 o o / 

1000101/5 



2 



A space-time diagram is shown in figure 4. This Green's function has gaps 
similar to an Abelian group CA, but the gaps are skewed. For t — 2 fe , the 
Green's function has k + 2 non-zero terms, located 0, 1, 2 . . . 2 k sites from the 
right end of the row. 

We can show this by induction: assume that (with x counting from the right) 
the only non-zero G x j are 

( g ifz = 
G x>t ={ f if x = 1 

[ 1 if x = 2 k , k > 

as we do for t = 1. Then the only non-zero G x ^t are 

( G 2 0t ^g 2 if x = 

Gx,2t=<^ Go, t G 1 , t + G 1 , t Go,t = fg + gf = f it x = I 

[ G 0}t G X}t + G Xtt G ,t + G 2 x/2 t = 2.g + 1 = 1 if .t = 2 fe , fc > 

and similarly if Go,* = g 2 ■ 

We'll call this logarithmic scaling — it's more complex than an Abelian group 
CA, whose Green's function has only a constant number of non-zero terms when 
t = p k . It has a corresponding scaling relation 

P t (a ...a t ) = a + a t + P t/2 (a t /2 ■ • ■ a t ) 
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10000000 1000 Wfg 



Figure 4: The Green's function for the logarithmic scaling CA. Non-zero ele- 
ments are black, zero is white. When t — 2 k , the gaps leave only k + 2 non-zero 
sites. The reader may recognize this as the Green's function for elementary CA 
rule 150, rotated 120° in space-time. More precisely, if we rotate the triangle 
of predecessors and successor c = fa + gb, we get a new rule a = f'b + g'c 
where /' = f^g and g' = f' 1 ; then f' 2 = g' 2 = f'g' + g'f = 1, so after two 
time-steps we get rule 150, P(ao, oi, 02) = a + ai + a 2 on both components. 
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Figure 5: Scaling from above, coming down by decreasing powers of 2 from the 
initial row. 

rather than the simple scaling P t (a , . . . , a t ) — a + a t we encountered before. 

So if t is a power of 2, we can add up the contributions from the Green's 
function in O(logt) serial time or O (log log t) in parallel (since it takes logn 
depth to add n things). But what about for general tl 

Using "scaling from above" as shown in figure 5, we start by jumping down 
2 k steps from the initial row, where k — [\ogt\ is the largest power of 2 in t. 
Each site in this row is the sum oik + 2 initial sites; then coming down the next 
largest power of 2 from there, each site at t = 2 k + 2 k ~ 1 depends on k + 1 sites 
in the row at t = 2 fc , and so on. 

In the worst case where t = 2 k+1 — 1, we have k + 1 such rows; multiplying 
the number of sites in each row by the number of terms each site depends on in 
the row above it, we get a total time of 

, k+i 

(k + 2)2 k + (k + l)2 k - 1 + ■■■ = -H r=2 X] ri = 0(k2 k+1 ) + (D(k 2 ) =0(tIogt) 

T i=0 

so we need serial time 0(t log t). In parallel, calculating this series of rows will 
take total depth 

k 

log(fc + 2) + log(fc + 1) H = J^logi = O(fclogfc) = C(logt log log i). 

i=l 

Alternately, we can use "scaling from below", as we did for scaling CAs 
before — jumping up the largest possible power of 2 from the bottom of the 
light-cone, then the next largest power of 2 above that, and so on. In the row 
2* above the bottom, we just need to know the sites whose positions are or a 
power of 2 from the right, since that's where G's non-zero terms are. In the row 
2 l_1 above that, we need to know the sites whose positions are a sum of two or 
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0,1,2,4,8,16 




0,1,2,3,4,5,6, 

8,9,10,12, 

16,17,18,20,24 



Figure 6: Scaling from below, jumping up by decreasing powers of 2 from the 
final site. The numbers show the positions (from the right) of the sites that 
need to be calculated. 



fewer powers of 2, and so on. So, as shown in figure 6, the total number of sites 
we have to calculate in the worst case where t = 2 k — 1 is 



i + (fc+i) + ((2)+ fc + 1 ) + --- + ( 2fe - 1 ) = EE(-)=E( fc -.?)(- 



i=0 j=o x ' 7 j=0 

— I 

dr ' 



=1 (l + rf = kT- 1 = (l/2)t!ogt 



so we get the same serial computation time up to a constant. In parallel, we'll 
do the same series of additions as with scaling from above, just in reverse order 
— giving the same parallel computation time. 

The reader can confirm that the Green's function has this logarithmic scaling 
behavior whenever 

Vfc : y\ = 1 where y = f and y k+1 = y k g 2> ~ + g 2 ' ' y k (7) 

In this case, y k = f for all k since fg + gf = fg 2 + g 2 .f = /, and f 2 = 1. The 
non-quasigroup CA where f = (\ ! ] and g = ( !? !? J is another example, 



since g 2 = g, fg + gf = f, and f 2 = 1. 
So we have 

Theorem 5. An additive CA whose coefficients satisfy equation (7) can 
be predicted in time 0(tlogt) in serial or 0(logt log log t) in parallel (between 
NCi and NC 2 J. 
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5.5.2 Morse sequence Green's functions 

As another interesting example of a vector- valued CA on Zf, let / = 



1 1 
1 



and g = ^ J . Since g isn't invertible, this isn't a quasigroup. We get the 
following Green's function: 

1 

/ 9 
P h g 
11/3 
/ h j h g 
f 2 k p k f g 
1 1 f 2 h g 

f 9 f 9 1 1 / 9 
phjhhhjhg 

where f 3 = 1, g 2 = g, fg+gf = h, g+f 2 = j, and g+l = fh+gf = fh+gj = k 
where h = ( ] ) , j = ( ® ?. J , and k 1 



Whenever t = 2 k , we see that G x . t consists of / or f 2 (for odd or even k) 
followed by a Morse sequence: 

hjhh hjhj hjhh hjhh hjhh hjhj hjhh hjhj . . . 

except that the last symbol is replaced by a g. This sequence is generated by 
the dilation operations 

h -» hj, j -> hh, g -> hg 

or alternately, by the requirement that the second half is identical to the first 
half except for the last symbol. 

As with the previous example, we can show this by induction. Since g = 
h + f = j + f 2 , when t = 2 k we can write G t = /(l + £*) + M t (k even) 
or Gt = / 2 (1 + ie*) + M t (k odd), where M t 's coefficients are all hs or js; for 
instance, M4 = hx + jx 2 + hx 3 + hx 4 . Then 

„2t 



or 



G 2 t = G 2 = f(l + x 2t ) + (fM t + M t /)(1 + .t*) + M t 2 



M 2t = (fM t + M t /)(1 + .t') + M t 2 



Let m tj i be the coefficient of Xi in M t , where 1 < i < t. Since fh + hf 
fi +jf = h, we have 

m 2 t,2i-i = h and m 2t ai = h + m tz 
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which, since h 2 = 1, j 2 = 0, and h + 1 = j, gives us the dilation rules h — > /y, 
j — ► hh above. 

Again, we have a more subtle kind of scaling than before: this time, 
P t (a ...a t )= -P t / 2 (ao • • • at/2) + Pt/i{a-t/2 • ■•»*) + &* 

This allows us to predict the CA in time 0(t) in serial or O(logi) in parallel if 
t is a power of 2. What can we do for general tl 

We can use the fact that these Morse sequences are almost periodic. First 
of all, j = h + 1, so if we subtract h from every site we get 

0100 0101 0100 0100 0100 0101 0100 0101 . . . 

The population of 0s is 2/3. Using the dilations — > 01, 1 — > 00, we find that 
0s map to blocks whose second half differs from the first half by one bit, while 
Is map to blocks whose first and second halves are identical. Therefore, the 
sequence is period-1 except for 2/3 of the sites; period-2 except for 1/3 of the 
sites; period-4 except for 1/6 of the sites; and so on. 

Now if we use scaling from above, we need to calculate rows of I sites 2 k 
steps below the initial row where I < 2 k . We can calculate the first site in full, 
the second from the first by adding 2/3 of the initial sites, the third from the 
first (and the fourth from the second) by adding 1/3 of the initial sites, and so 
on: always shifting the largest possible power of 2 from a previously calculated 
site. This takes (serial) time 

2 fe (l + (2/3) + (1/3) + (1/3) + (1/6) + (1/6) + (1/6) + (1/6) + (1/12) + . . .) 

= 2 fe (l + (2/3) log/) < (2/3)fc2 fe + 2 fe 

as shown in figure 7. 

Doing this for a series of rows descending by powers of 2 from the initial 
row, we get a worst-case leading behavior of (4/3)ilogi. (In parallel, we can 
calculate all I sites in each row in time 0{k) 1 and then doing this for \ogt rows 
gives time O(log 2 t).) 

However, this doesn't quite work. When we calculate a new site in a row 
from a site 2 m to the left by shifting the Green's function, we also have to add 
contributions from 2 m new initial sites on the right (and subtract 2 m of them 
from the left). Of course, we can use the same shifting trick to calculate these 
2 m sites from previously calculated sums of 2 m sites, and so on; but even when 
we use these techniques to their best advantage, we still find the time it takes 
to calculate the 2 fe th site in a row, even if the previous 2 k — 1 sites are known, 
is proportional to 

V2fc2^ 

Each row takes at least 2 fe times this to calculate, and then summing over 
successive values of 2 k we get a leading behavior of 

i v /21ogt2\ / 2 T °s* 
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01000101010001000100010101000101 

01000101010001000100010101000101 2/3 

01000101010001000100010101000101 1/3 

01000101010001000100010101000101 1/6 

01000101010001000100010101000101 1/12 




Figure 7: The Morse sequence is almost periodic. Shifting by a power of 2 
changes an exponentially decreasing number of bits (shown here in bold). There- 
fore, sites can be calculated from others in the same row by adding contributions 
from an exponentially decreasing number of initial sites. 

This is an interesting function, bigger than t\og k t for any k, but less than t 1+c 
for any e > 0. 

However, as fun as this algorithm is to work out, a more general method is 
in fact more efficient, as we will show next. 

5.6 A general method 

Is there a method that works for all quasigroups separably isotopic to an Abelian 
group? The polynomial method we used before works perfectly well with matrix 
coefficients — just let Gi yt be the coefficient of x % in G\ where G% = (/ + gx). In 
fact, this works just as well if / and g are homomorphisms, so the CA doesn't 
need to be a quasigroup. 

Adding the sum ^2 X G x ^a x only takes linear serial time or logarithmic par- 
allel time. So how fast can we calculate (/ + gxfl 

In order to raise a polynomial G\ to the tth power, we can iterate between 
two operations: squaring it, which doubles t, and multiplying by Gi, which 
increments t by 1. The latter can be done in linear time; the former is more 
difficult, and we'll need to do it up to logi times. 

Multiplying nth-order polynomials in one variable is essentially the same as 
multiplying n-digit numbers, except that no carrying of digits takes place. The 
fastest known serial algorithms take time O(n\ogn log log n) on a multi- 
tape Turing machine, 0(n\ogn) on a RAM, and 0(n) on a storage modification 
machine, a machine which can create new registers with pointers to each other. 
(Multiplying polynomials whose coefficients are n x n matrices, for fixed n, just 
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adds a multiplicative constant to the computation time in serial and an additive 
constant in parallel.) 

Iterating all of these log t times on successively doubling values of n gives 
n oc t as the leading behavior, so: 

Theorem 6. Any CA which is additive on an Abelian group with matrix 
coefficients (including CAs which are separably isotopic to an Abelian group) can 
be predicted in serial time Oit logt log log t) on a multi-tape Turing machine, 
0(tlagt) on a RAM, or 0{t) on a storage modification machine. 

In parallel, n-digit numbers can be multiplied in time O(logn) by 0(n 2 ) 
processors, so summing over n — 2 k for k < logi we get time (1/2) log 2 n. So 

Theorem 7. Any CA which is additive on an Abelian group with matrix 
coefficients can be predicted in parallel time 0(log 2 t) by 0(t 2 ) processors, so its 
prediction is in NC2. 

6 Non-Abelian groups 

If a group is non- Abelian, we can no longer use the Green's function formalism: 
each dj's contribution is entangled with the others', so P t is no longer a linear 
function of the a*. 

Unforunately, the isotopy approach can't help us either. Since groups are 
isomorphic if they're isotopic |Q, a non- Abelian group can't be isotopic to 
an Abelian one. However, for certain non-Abelian groups we can still find an 
efficient solution. 

6.1 The Quaternions Q 8 

The Quaternion group Qs is the group {1, — 1, i, — i, j, — j, k, — k} with the fol- 
lowing relations: 

■2 -2 ; 2 -1 

1 = j = k = —1 

ij = k, jk = i.ki = j 

ji = -k,kj = -i,ik = -j 

and 1 and —1 act as we would expect ((— l)i = i( — 1) = — i, etc.) 

Note that Qs is "almost Abelian" in that oh = ±ba, or [a, b] — ±1 where 
[a, b] = aba~ 1 b~ 1 is the commutator of a and b. The commutator describes to 
what extent a and b don't commute: we can use it to re-arrange variables, as in 
ab = [a, b]ba. 

The center of a group is the subgroup of all elements that commute with 
every element, in this case {1,-1}. All commutators are in the center; so are 
all squares, since a 2 = ±1 for all a. So [a, b] = 1 if a or 6 is a commutator or a 
square. We also have a 4 — 1 and [a, b] 2 = 1, and we can derive identities like 
[a, b] = a 2 b 2 (ab) 2 and [ab,cd] = [a,c][a,d][b,c][b,d]. 
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The upshot of all this is 



a ai 




a 3 a 4 



and in general, if t 



2 for some k 



P t (a , ...,a t ) = af /2 Q [a*, a t -i] a a t . 




(8) 



We can prove this by induction: if it's true for t/2, then 

Pt = Pt/2(Pt.2( a o, ■ ■ ■ , at/2), A/2 (ai, ■ • -,04/2+1)) • ■■Pt/2(a t /2, ■ ■ -,a t )) 



which simplifies to (8). 

So except for the sign, this CA acts just like Z\ with (0, 0) = ±1, (0, 1) = ±i, 
(1,0) = ±j, and (1, 1) = ±k (since, algebraically, Q$/{±1} — Z%) with the sign 
determined by the product of commutators in (7). 

So if t is a power of 2, P t can be calculated in linear time. For other t, we 
can go up by powers of 2 from below. The sign of s is the product of the row 
of commutators times the signs of its predecessors 2 k steps above; their signs 
are their rows of commutators times the signs of their predecessors 2 fc_1 steps 
above them, and so on. So we need to calculate rows of intermediate products 
as shown in figure 8. Since commutators and squares can be removed from a 
commutator's arguments (since they commute with everything), we only need 
to know these rows up to a sign, i.e. the Z\ part of them. So we need to make 
a short diversion. 

6.1.1 Calculating a row of a scaling CA 

As we discussed in section 3, Z v with p prime scales with scaling ratio p, so for 
instance Z 2 scales with ratio 2. A single site at the bottom of a light-cone can 
be calculated in linear serial time as shown in figure 2. But now suppose we 
want to calculate a row s , Si , . . . s/_i of I sites t time-steps down from the initial 
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Figure 8: The intermediate states that need to be calculated to predict the 
Quaternion CA. The hollow circles just need to be calculated up to a sign, since 
only their squares or commutators affect the final state. 



Figure 9: Using scaling from above to calculate a row of a scaling CA. 

conditions. In parallel, we can calculate all of them at once in time O(logt); 
but to do it efficiently in serial requires more thought. 

If we calculate each one independently using this method, the serial time is 
It. But the area of the space-time trapezoid above the target row is (l + t/2)t, so 
unless I <C t this isn't any better than direct simulation. Luckily, the neighboring 
sites in the target row are not independent, so we can save time by re- using 
information. 

There are two ways to do this: "scaling from above" is shown in figure 9. 
Calculate an entire row 2 k below the initial conditions, and then take the next 
power of 2 down and so on until we reach the target row. The computation 
time is then 



where is the number of Is in i's digit sequence, and kj(t) is the position 




#i(t) 



i=i j=i 
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Figure 10: Another way of calculating a row of adjacent sites in a scaling CA: 
jump down to a t + At with a sparse Green's function (i.e., a sum of a few 
powers of 2) and then reverse the CA rule to get the row at t. 

of the jth 1, starting from the most significant. So the leading behavior is 
l#i(t) + t, which is llogt + t in the worst case (namely, if t = 2 k — 1). 

This works well if t has just a few Is in its digits, and G Xi t is sparse (zero 
for most x). But if t has a lot of Is, G Xt t will have a lot of overlap with G x +ij 
so Si and Sj+i are closely related. For instance, if t = 2 k — 1, G x j = 1 for all 
< x < t and s i+ i = s, + a i+t+ i - o^. 

This is equivalent to calculating the row immediately below this one, at 
t + 1 = 2 k whose Green's function only has two non-zero entries, and then 
reversing the CA to get Sj+i from Si and their successor. 

In general, suppose we find some At such that G is sparse for both t + At 
and At. Then we calculate the row at t + At and reverse At steps of the CA 
to get the row at t. (We have to calculate the first At of the row at t before we 
can get going; then we use reversal to get the remaining I — At of them.) This 
takes time 

At#i(t) + (Z - At)(#i(t + At) + #i(At)) + 2(t + At). 

So for a fixed At and increasing I, the leading behavior of this algorithm will 
be less than the first if #i(t + At) + #i(Ai) < #i(t), i.e. if t is the difference 
between two numbers with fewer total Is than t has. This is shown in figure 10. 

This is true whenever t has a long string of Is in its digit sequence, e.g. 
101110 = 110000-10. So for t = 101010 . . ., or roughly t = 2 fe /3, this algorithm 
can't improve matters. Since #i(t) = (l/2)logt then, both algorithms take 
serial time proportional to I \ogt + t. 

Note that we have not shown this is a lower bound — only that these two 
algorithms have this behavior in the worst case. Scaling ratios other than 2 
simply change these results by a constant. 
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Figure 11: The Steiner triple systems for n = 3, 7, and 9. 



6.1.2 Back to the Quaternions 

Using this formula, we return to figure 7. The time to calculate all the rows 
(ignoring the term linear in t because h > U) is at most 

#i(t) #i(t) 

Y, 2 i " 1 / i # 1 (^)= 2 l - 1 2' £ ^ t )(#iW-*)<2 fel # 1 (t) 2 /2<t(logt) 2 

i=l i=l 

where the inequality is a proportionality in the worst case. So the Quaternion 
CA can be predicted in serial time 0(tlog 2 t). Again, we can't rule out the 
possibility of a cleverer algorithm, but we conjecture that this is a lower bound. 

In parallel, we can calculate all the rows of commutators in O(logt) steps, 
multiply them together to get their products in another O(logt) steps, and 
multiply all those products to get the sign of the final state in another 0(\ogt) 
steps — for a total which is still 0(logt). So in parallel, the Quaternion CA 
can be predicted just as quickly as one based on an Abelian group (although 
the number of processors grows as 0(t 2 ) instead of 0(t)). 

Theorem 8. The Quaternion CA can be predicted in 0(t\og 2 t) in serial 
or O(logt) in parallel (with 0(t 2 ) processors) , so its prediction is in NCi. 



7 Steiner systems 

In addition to groups, there are other algebraic structures on which a CA could 



be based. A Steiner triple system |14 is a set Ss >n with a set of three-element 
subsets or triples such that for any two distinct elements s±, S2 there is a unique 
triple containing them. 

A simple counting argument shows that the number of elements n in a Steiner 
triple system must satisfy n mod 6 = 1 or 3, and the total number of triples is 
n(n — l)/6. There are in fact systems Sz,n °f every possible size. For n = 3, 7 
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and 9 the system is unique: 63,3 is a single triple {1,2,3}, 6*3,7 is a cyclically 
symmetric system 

{1, 2, 4}, {2, 3, 5}, {3, 4, 6}, {4, 5, 7}, {5, 6, 1}, {6, 7, 2}, {7, 1, 3} 

and 5*3,9 = 6*3,3 x 63,3, with triples of the form {(xi,yi), (2:2,2/2), (x 3 ,y s )} where 
the Xi are either all different or all the same, and similarly for the j/j. These are 
shown in figure 11. There are two non- isomorphic 63,13, one of which is cyclic 
(with 13 copies each of the triples {1, 2, 5} and {1, 3, 8}) and 80 non-isomorphic 
63,15- 



7.1 S quags 

To turn a Steiner system into an algebra, we can define a binary operation * 
on 6: let si * S2 be the third element in the unique triple containing si and S2- 
Since the triples are unordered, this operation is totally symmetric: if a * b = c, 
then c*b — a and similarly for all other permutations; and (a * b) * b = a. 

The only remaining question is what value to assign to a * a. If we choose 
to make the elements idempotent, a * a = a, we get a kind of quasigroup called 
a squag |Q . 

It's easy to see that squags are not associative, since if a * b = c then 
a* (b*c) — a * a = a but (a*b)*c = c*c = c. Our only method so far of dealing 
with non-associative quasigroups is to make them isotopic to Abelian groups. 
Under what circumstances can a squag be isotopic to a group? 

Suppose a * b = f(a) • g(b) where • is a group operation with identity e. 
Let /'(a) = /(a) • /(e)" 1 and g'(b) = /(e) • g(b); then a * b = f'(a) • g'(b) and 
/'(e) = e. So we can assume /(e) = e, and e = e*e = e» g(e) = g{e), so 
g(e) — e also. 

Now g(a) — e • g(a) = e * a, and /(a) ~ f(a) • e = a * e. But since * is 
commutative, /(a) = g{a). Then a = a * a = /(a) • /(a) = /(a) 2 ; similarly 
f(a) = /(/(a)) 2 , so a = /(/(a)) 4 . But f(f(a)) = ((a * e) * e) = a, so a = a 4 . 
So a 3 = 1, and /(a) = a 2 = a -1 . 

Now a • b = a^ 1 * b^ 1 = b^ 1 * a -1 = 6 • a, so (6, •) is in fact Abelian. Since 
a 3 = 1 for all a, (6, •) must be Z| for some fc and n = 3 fe . Prediction is easy, 
since a * 6 = (afr) -1 : Pt = P Z k t for even t and PZjj r, for odd t. 

But this only works for Steiner systems that are some power of 63,3. For 
instance, the 63,7 squag is not isotopic to any group, and it's unclear how, or 
whether, it can be predicted efficiently. 



7.2 Sloops 

Instead of making the elements idempotent, we can add an identity e and make 
them nilpotent, a* a = e. Then (6, *) is a quasigroup with an identity, or a loop] 
this particular kind is called a sloop Jl4[. 



2G 



a(bc)=(ab)c 




a c 



Figure 12: A graphical representation of associativity, on three distinct elements 
that do not form a triple (i.e., are not collinear). 

Now if a loop is isotopic to a group, it's isomorphic to it so the only 
sloops that can be isotopic to groups are those that are groups themselves. 
They're Abelian, and a* a — e, so they're Z| for some k and the Steiner system 
has n = 2 k — 1. The elements of S3, n are non-zero vectors of k components 
that sum to zero mod 2 (surprisingly, setting k = 3 gives the same 6^7 as the 
cyclic structure shown above). These are also precisely those Steiner systems 
that are associative on any three distinct elements that do not form a triple, 
shown graphically in figure 12. 

The smallest sloop that cannot be handled this way is the 10-element sloop 
formed by adding an identity to S^g. Again, it's not clear whether there's an 
efficient algorithm for it. 

However, sloops and squags act the same on a restricted set of initial con- 
ditions: namely, those in which substrings of the form aa never occur, and whose 
successors have the same property. This means Pt{ai 1 ■ ■ ■ ,&i+t) 7^ Pt{a,i+\ 1 ■ ■ ■ ,a a+t +i) 
for all t, so 

at 7^ o»+i, di 7^ a i+ 2,a,i 7^ ai + i(a,i + 2a,i + 3), 

a% ^ a l+ i{{a l+ ia l+ 2){{a l+2 a l+i ){ai + j,a l+A ))), . . . 

We conjecture that, for n > 3, these constraints leave a non-empty set of initial 
conditions on which we can handle S3. 7, S3, 9, Ss^, and others whose sloops or 
squags are isotopic to Z\ or Z\. It would be nice if deviations from this set of 
initial conditions could be handled as defects on a predictable background (such 
as in §). 
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a 




Figure 13: Multiplication in a Stein quasigroup. Each element leaves itself fixed 
(is idempotent) and rotates the other three. 

7.3 Quadruple systems and Stein quasigroups 

We can generalize the idea of a Steiner triple system in several ways. A Steiner 
quadruple system is a set S^ n with four-element subsets or quadruples, such that 
every distinct pair of elements is contained in a unique quadruple. The number 
of elements n satisfies n mod 12 = 1 or 4, and the number of quadruples is 
n(n — 1)/12. S4.4 is a single quadruple; 6*4^3 is cyclically symmetric with 13 
copies of (1,2,5,7), and 54,16 is 5*1 4- 

A Steiner quadruple system can be made into an quasigroup by assigning an 
orientation or parity to each quadruple. From a quadruple (a, b, c, d) we define 
a*b = c and b* a = d, and then use permutations of even parity to get the rest 
of the multiplication table: 
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d 



with each element idempotent and rotating the other three, just as if we rotated 
a tetrahedron around one of its vertices as shown in figure 13. This Stein 
quasigroup |]l4| satisfies the relations (a*b)*b = b*a and (b * a) * b = a. 

When is it isotopic to a group? Again, assume a * b — f(a) • g(b) where • 
has identity e and /(e) = e. Then e = e* e = e • g(e) = g(e), so g{e) — e also. 

Now / (a) = /(a) »e = a* e and g(a) — e*g{a) — e*a. But f(g(a)) = (e*a) * 
e = a, so g = / _1 and / and g commute. Moreover, f 3 (a) = ((a * e) * e) * e = a, 
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so f 3 = 1 and g = f 2 . 
But we also have 

a • a — (e * a) * (a * e) = (e * a) * ((e * a) * a) = a * (e * a) = e 

so a 2 = e for all a. So the group must be Z\ for some fc; 2 fc mod 12 = 4 for 
fc > 2 even and 8 for fc > 3 odd, so fc must be even and n is a power of 4. 

We also need to check that / is a homomorphism on •. If we replace a with 
/(a) and b with g(b), then * must have an interesting self-distributive property 
(for left or right multiplication) 

e * (a * b) = (e * a) * (e * b). 

But this follows from another fact: since • is Abelian, a • b = b • a and 
(e * a) * (b * e) = (e * b) * (a * e). But then (omitting *) 

a(ec) = (e(ae))((ce)e) = (e(ce))((ae)e) = c(ea). 

Since S 1 is isotopic to a group, every clement of S is isomorphic to every other; 
specifically* is isomorphic to another operation o with identity b instead of e, 
a o c ~ a • c • b^ 1 . So we can replace e with any element b, and get Abel- 
Grassmann's law 

a(bc) = c(ba). 

This identity is shown graphically in figure 14 (it holds automatically if a, b and 
c arc in the same quadruple). But then since (ab)(cd) = d(c(abj) = d(b(ac)) = 
(ac)(bd), we also have the medial identity 

(ab)(cd) = (ac)(bd) 

from which self-distributivity follows: 

e(ab) — (ee)(ab) = (ea)(eb). 

Self-distributivity means that multiplication by e, or any other element, is an 
automorphism on S: in particular, it sends each quadruple to another quadruple, 
either the same one or one disjoint from it. So we can choose a quadruple, and 
group all of S into its images, giving us another Steiner system (quadruples of 
quadruples) with n/4 elements. It has the same properties, so we can continue 
the process inductively until we get to a single quadruple: so S = (S^)* for 
some k. (We can use a similar argument to show that the squag above is in fact 
a power of 53,3.) But this leaves us with no algorithm for 54,13. 

In any case, we conclude this section with 

Theorem 9. If a Stein quasigroup is isotopic to a group, it is S$ 4 for some 
k and f — g^ 1 ; if a squag is isotopic to a group, it is 5f 3 for some k and f = g. 
In either case, f and g commute and so it can be predicted in 0{t) serial time 
or Oilogt) parallel time, and its prediction is in NCi. 
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Figure 14: A graphical representation of Abel-Grassmann's law a(bc) — c(ba) 
in a Stein quasigroup. 

8 Miscellaneous structures 
8.1 The Octonions 

The Octonion quasigroup Oie consists of 16 elements {±1, ±i, ±j, ±fc, ±E, ±7, ± J, ±if}, 
with the following multiplication table: 
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Just as the Quaternions are commutative up to a sign, the Octonions are as- 
sociative up to a sign. If we define the associator {a, b, c} = ±1 such that 
a(bc) = {a,b,c}(ab)c, then {a,b,c} = a 2 b 2 c 2 (ab) 2 (bc) 2 (ca) 2 (abc) 2 with the fol- 
lowing properties: 

1) It is invariant under permutations, so that {a, 6, c} = {&, c, a} = {a, c, b}. 

2) {a, a, b} = 1, so that a(ab) — a 2 b and a(ba) = (ab)a. This makes Oie an 
alternative algebra Q . 

3) The associator of a product is the product of associators, i.e. {a, 6, cd} = 
{a, b, c}{a, 6, a 7 } (this was true for commutators in Qg) 
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Figure 15: Scaling the set of associators for the Octonion CA. Each group of 
three elements corresponds to an associator. 

4) ±1 associates and commutes with everything, and commutators, associa- 
tors, and squares are all ±1. 

Just as Qs/{±1} — Z$, Oi6/{±l} = Z%; so we can treat the Octonion CA 
as addition mod 2 on three components, with a prefactor of ±1. 

For t = 2, we have 

(a ai)(aia 2 ) = {a , oi, aia 2 }a (ai(aia 2 )) = a\{a , ai, a 2 } ■ a a 2 

since {ao, a\, a\a 2 } = {ao,a\,ai}{ao,a\,a 2 } — {ao,ai,a 2 } and a\ — ±1. For 
t = 4, the prefactor consists of the Quaternion prefactor a|[ai, 0,3], times 

{a Q a 2 ,a 1 a 3 , a 2 a 4 }{a , 01, a 2 }{a 2 , a 3 , a 4 }{a , a 2 , a 4 } 

In general, the prefactor for t = 2 k+1 consists of {ao, a 2 k , a 2 (k+i)} times three 
copies of the prefactor for t = 2 k , with one copy ("A") as is, one ("B") where 
at is replaced by a i+2 k, and one ("C") where is replaced by aia i+2 k. This is 
shown graphically in figure 15. 

Note that the intermediate states we need for the associators in C are always 
products of intermediate states we needed for associators in A and B anyway. 
So each associator takes constant additional time to calculate in serial. Since 
the Quaternion part of the prefactor can be calculated as we did before, and the 
number of associators in the prefactor triples whenever we double t, we have 
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r=3 




Figure 16: If, as for the Octonions, a CA scales except for a prefactor which 
associates and commutes with all its elements, then we conjecture multiplying 
t by r should give r(r + l)/2 copies of the prefactor expression. Here r = 3. 

Theorem 10. The Octonion CA can be predicted in serial time 0(£ los3 / log2 ). 

In parallel, we can calculate the associators and intermediate states in A and 
B at the same time, and then calculate those in C in constant time; so doubling 
t only adds a constant, and 

Theorem 11. The Octonion CA can be predicted in parallel time O(logt), 
so its prediction is in NCi. 

These arguments would seem to hold whenever a CA is equivalent to a 
scaling CA except for commutators and associators which are in its center (the 
set of elements which commute and associate with everything) . So we make the 
following conjecture: 

Conjecture. If a CA 's commutators and associators are in its center, and 
if the CA mod its center scales with ratio r, then it can be predicted in serial 
time 0(t a ) where 

a = logr(r + l)/2 < 2 
logr 

and parallel time OQogt), so its prediction is in NCi. 

This value of a is based on figure 16, where we assume that r(r+ 1)/2 copies 
of the t = r k expression will be required in the t = r k+1 expression. 

(Since the associator of products is a product of associators, we could con- 
sider breaking down these expressions to a product of associators of single initial 
sites. In fact, this is how we got an algorithm for the Quaternions that runs in 
time ilog 2 i instead of t log3 / l ° e2 . Unfortunately, in the case of the Octonions 
this seems to give a number of associators that grows by a factor of 6 each time 
we double t, giving t los6 / los2 > t 2 .) 
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Figure 17: The rocks-scissors- paper CA, and the push-down automaton algo- 
rithm that predicts it. The stack represents successive right diagonals of the 
light-cone as we move through the input. 

8.2 Idempotent semigroups 

If • is associative and aa = a for all a, then we have 

a a\ a 2 a 3 

a ai aia 2 a 2 a 3 

ao a ii2 = aoCL\a2 a\a\a^ = aia 2 a 3 
a (aia 2 ) 2 a 3 = a aia 2 a 3 

Even without associativity, if the weaker property (ab)(bc) = (ab)c holds, a 

simple induction shows that Pt = n" sM a i — (' ' ' (( a o fl i)a2) ■ ■ ■ at), and simi- 
larly if (ab)(bc) = a(bc) with the products on the left. 

8.3 CAs with left- and right-moving domain walls 

Suppose, for all a and b, we have ab — a or ab = b. Then every boundary 
between a domain of as and 6s moves left or right with a velocity 1/2. We say 
a dominates b from the left or a >- b if ab = a, and so on. 

Such a system need not be associative or commutative. It is associative if -< 
and >- are transitive, and commutative if a y b implies b -< a. Figure 17 shows 
an example, the "rocks-scissors-paper" CA (see Q), r >- s y p y r, which is 
commutative but not associative: r(sp) = rs = r, but (rs)p = rp = p. 

These CAs can be predicted in linear time in the following way, using a stack 
as memory. Start with an empty stack, and add ao. Then read the ai from left 
to right: for each aj, pop all the as off the stack for which a -< a^ until we get 
to one for which a y a,;. Then add at to the stack on top of it. Finally, the 
result s is the symbol at the bottom of the stack. 
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Figure 16 shows how the algorithm works: the stack at each stage represents 
the rightmost diagonal of the light-cone bounded by do and a% ■ Since the total 
amount of pushing and popping is bounded by t, it just takes linear time. 

Since this algorithm can be carried out by a deterministic push-down au- 
tomaton (the finite state control can keep track of the bottom stack symbol) we 
have shown that, for each s, the set L s of initial sequences w — a ai . . . a t such 
that Pt(w) = s is deterministic context-free language It can be generated by 
the following grammar, from the initial symbol / s and symbols s, s', X s> , X^ s , 
and X s y^ t for all CA states s and t: 

Is — y X-^sS' X s y 

X <s — > X <t t' or t'Xty^ s if t -< s 

X sy -> X sy ^ t t' or t'X ty if s >- 1 

X s y !<t -> X sy><u u' or u'X u y !<t if s y u -< t 

Here the Xs represent places where we can add new domains, tracing backwards 
from the final site, that are annihilated (from the left, the right, or both) by 
the domains we already know about. To these productions, we add X — > e (the 
empty string) to erase the Xs at the end, and s' — > ss' or e to create multiple 
sites in each domain (with only unprimed symbols in the final product). 

This grammar is unambiguous, meaning that each sequence w has a unique 
derivation tree using these productions. Unambiguous context-free languages 
can be recognized in C(log \ w\) parallel time [|l8|; in addition, this PDA is input- 
driven, meaning that the change in its stack height depends only on the current 
input symbol, and for such PDAs this can be done with C(|w|) processors fCsf . 
Moreover, the above algorithm clearly takes linear time on a two-tape Turing 
machine. So 

Theorem 12. CAs with the property that ab — a or b, so that domain bound- 
aries move at constant speed and annihilate, can be predicted in time 0{t) in 
serial or Oilogt) in parallel (with 0{t) processors), so their prediction is in 
NCi. 

Since a two-dimensional version of the rocks-scissors-paper CA generates 
spiral waves, it's interesting to ask whether spiral waves in general, or any CA 
model of them, can be efficiently predicted or parallelized: are they, for instance, 
in NC? 

9 Scaling relations and the medial identity 

As stated briefly above, a CA scales with ratio r if P r (ao, . . . ,a r ) — ag • a r , 
so that r steps is equivalent to one step on the leftmost and rightmost initial 
conditions. As we saw in section 3, scaling allows us to predict a CA in linear 
time. 
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(ab)(cd)=(ac)(bd) 



Figure 18: The medial identity follows from scaling. To predict r + 1 time-steps, 
we can use the scaling relation on the first r or the last r, and these must yield 
the same result. 

What algebraic properties must a scaling CA have? Consider P r +\ (a,b, . . . ,c,d). 
As figure 18 shows, we can use the scaling either for the first r steps, or the last 
r. Both these methods must yield the same result, so the medial identity holds 

(a • b) • (c • d) = (a • c) • (6 • d) 

which we ran across earlier in our discussion of Steiner systems. (Mediality 
alone can't help us predict a CA, since all the products in (1) are of the form 
(ab)(bc).) 

Any medial quasigroup with identity is an Abelian group, since we immedi- 
ately get associativity and commutativity: a(bc) = (ae)(bc) = (ab)(ec) = (a6)c, 
and ab = (ea)(be) = (eb)(ae) = ba. In addition, any quasigroup separably iso- 
topic to an Abelian group, i.e. of the form a • b = f(a) + g{b) + h where / and 
g are homomorphisms and h is a constant, is medial if / and g commute. 

In fact, it can be shown that all medial quasigroups are of this form J12fl , 
with ft, = if there is an idempotent element ee — e (which becomes the identity 
of the group). So the only quasigroup CAs which have a scaling relation are 
those we already learned how to deal with in section 5.1. However, there could 
be non-quasigroup CAs which scale (i.e., medial groupoids) for which scaling 
gives the only fast algorithm. 

In any case, we might as well state what we've already proved: 

Theorem 13. Any CA with a scaling relation satisfies the medial identity. 
Furthermore, it can be predicted in 0(t) serial time or O(logt) parallel time, 
and so its prediction is in NCi. 

10 Principles of superposition 

Just as for scaling, we can ask what algebraic properties are required for a CA 
to have a principle of superposition. 
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(a.b)*(c.d)-(a*c).(b*d) 



Figure 19: The generalized medial identity (9) follows from a principle of su- 
perposition. We can apply the superposition operation * before or after the CA 
rule •. 

In general, a principle of superposition means that the state space has some 
structure on which the CA rule, seen as a function / : A 2 — ► A (or on sequences 
A n — ► A n_1 , or on bi-infmite suquences A z — > A z ) is a homomorphism. So, for 
instance, if there is a binary operation * on the alphabet A, we write /((a, b) * 
(c, <i)) = /(a, 6) * /(c, d). Writing aub for /(a, 6), we get a pleasantly symmetric 
identity 

(a • 6) * (c • d) = (a * c) • (6 * d) (9) 

as shown in figure 19 — a kind of generalized medial identity with two opera- 
tions. If • and * are the same the normal medial identity holds, as was pointed 
out in pc| |. (Incidentally, (9) also implies that * and • commute as CAs, since 
(a • b) * {b • c) = (a * b) • (b * c).) 

This identity (9) alone doesn't tell us very much. However, there are a few 
assumptions we can reasonably make. 

First of all, in order for a principle of superposition to be useful, we need 
a notion of a S-function 5 x , a : namely, an initial condition which is "zero" (an 
identity) except at site x, where it is a. Then any initial condition can be built 
up out of the &, <5i :Ql * <5 2 , Q2 * • • • * 5 t>at — (ai, a 2 , . . . , at). 

So * must have an identity, e*, so that 5 Xja can be e* everywhere but at x. 
But then 

a»b = (a * e*) • (e* * b) — (a • e*) * (e* • 6) 

Moreover, 

e* • (a * b) = (e* * e*) • (a * b) = (e • a) * (e • b) 

So the function /(a) = a»e* is a homomorphism with respect to *, and similarly 
for g(a) = e* • a. But this is just 

a • b = f(a) * g{b) 

again — in other words, • is homomorphically isotopic to *. 

Counterintuitively, * is not necessarily associative or commutative. For in- 
stance, if a • b = h(b), then (9) holds whenever ft, is a homomorphism of *. 
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at this time-slice, 
the initial conditions 
have disjoint support 



to commute a and b 



to associate a, b, and c 



Figure 20: Overlaying initial conditions with disjoint support to get commuta- 
tivity or associativity on elements whose positions in the Green's function are 
separated by spacelike intervals (the dotted lines). 

However, we can show that * commutes and associates on some elements: for 
instance, we have 

(a • e*) * (e* • b) = (a * e*) • (e* * b) = (e* * a) • (b* e*) = (e* • b) * (a • e*) 

so * commutes on pairs of the form a • , • 6. This symbolizes the fact that 
it doesn't matter what order we * together the 5 functions (a, e*) and (e*,6); 
since each one is e* where the other isn't, and e* commutes with everything. 
Similarly, * must be associative on any three elements from the t = 2 row of the 
Green's function, of the form (e*e*)(e*a), (e«b) (be*), and (ce st )(e >t e >t ). 

More generally, * is commutative on any two Ss with different x, and asso- 
ciative on any three; and since we can go down to any row where their Green's 
functions overlap, we know that * is commutative on any two elements that 
occur in the same row of the Green's functions of some two elements, and asso- 
ciative on any three. 

We can generalize this further to pairs or triplets that appear in Green's 
functions at positions with spacelike intervals between them; i.e., Ax > At or 
Ax < if At > 0, and Ax < At or Ax > if At < 0. As shown in figure 
20, we can overlay two or three initial conditions with disjoint support (each is 
e* wherever the others aren't) to put these elements in the same place; since 
* associates and commutes on these initial conditions, it must also on these 
elements. 

Since any spreading of the Green's function will lead to a spacelike interval, 
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it appears that * will be associative and commutative except on parts of the 
Green's function that don't spread out at all; for instance, with a • b = h(b), 
where information is only travelling to the left (or with an r — 1 CA we could 
construct an example with a non-commuting part travelling straight down). 
This kind of behavior doesn't add much interest, and is certainly no problem 
to predict, being just an identity map on sequences with each symbol being 
mapped by h. 

Finally, if we know * to be a quasigroup, then associativity and commutativ- 
ity make it an Abelian group; then • is homomorphically isotopic to an Abelian 
group, and the general method of theorem 6 can be used to predict it. 

A few other lemmas of some interest, assuming a principle of superposition 
(9) and an identity e*, are: 

Lemma 14. If • is a quasigroup, * is an Abelian group. 

Proof. If • is a quasigroup, then »(a, b) = a • b is one-to-one on both its 
inputs. But a • b = (a • e*) * (e* • b) and a • e* and e* • b can take any values c 
and d, so c * d must be one-to-one on both c and d. So * is a quasigroup with 
identity, or a loop; moreover, both • and * are isotopic to an Abelian group (by 
a theorem of Sade |u| following from (9)) and loops that are isotopic to groups 
are isomorphic to them [T^ |. So * is an Abelian group. 

Lemma 15. If every element a can be expressed as a product bu c, then 
is idempotent under • (i.e., e* • e* = e*). 

Proof, (e* • e*) * a = (e* • e*) * (6 • c) = (e* * b) • (e* * c) — b • c = a. So 
e* • is also an identity of *, but identities are unique. 

An element that can't be expressed as a product b • c can only appear in the 
initial row. So we can take these elements out of A, move forward one step, and 
continue with e* • e* as our new identity on the remaining subset of A. 

Lemma 16. //• has a left (right) identity e,, then is also a left (right) 
identity of •. 

Proof. Suppose e, is a left identity of •. Then a = e,»a = (e.«e*)*(e*»a) 

f , * (e* • a) = e* • a, so e* is also. Similarly for right identities. 

Corollary. If • has an identity, then e, = e* and • and * are identical. 

Proof. Since e, is both a left and a right identity of •, so is e*. So e. • e* = 
e. = so they're the same e. Then a • b — (a • e) * (e • b) — a * b. 

11 Conclusion 

We have derived the following computation times for CAs with various proper- 
ties: 
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CA property 
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Left- and right-moving domain walls 
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log* 
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in general 
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log 2 * 
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Quaternions 


t log 2 1 


log* 


Octonions 


£log 3/ log 2 


log* 


Direct simulation 


1? 


t 



We have shown that some nonlinear rules, like the Quaternion CA or CAs 
with moving domain walls, are in fact easily predictable. We have also shown 
that linearity alone does not assure linear-time predictability; CAs with complex 
Green's functions can take time up to t log t log log i on a Turing machine or 
log 2 t in parallel. In addition, in the parallel case we can make some distinctions 
based on the number of processors; for instance, 0(t) for Abelian and domain- 
wall rules, and 0(t 2 ) for vector-valued CAs and the Quaternions. Finally, we 
have explored the algebraic consequences of macroscopic properties like scaling 
and superposition. 

Is it possible that all CA rules can be predicted more efficiently than by 
direct simulation? In particular, that all CA rules are in NC, meaning their 
parallel computation time is log-* t for some j? The precise answer is only if 
P = NC, i.e. only if all problems solvable in polynomial serial time can be 
efficiently parallelized (solved in polylogarithmic parallel time). 

This is because the general problem of predicting a CA for t time-steps is 
P- complete: any polynomial serial time problem can be reduced to it. This is 
easy to see with a CA that simulates a Turing machine, such as in Ej . Consider 
a Turing machine program that runs in time n k on input of length n. Pad the 
input with blanks so that the input size is n k and the time is equal to the input 
size. If this CA is in NCj we'll get our answer in k 3 log 3 n parallel time, showing 
that the original problem is in NCj. 

Another perspective is to think of CAs as Boolean circuits, with gates at 
each site in space-time. Calculating the value of such a circuit is P-complete; 
however, calculating the value of a Boolean expression is in NCi. So all CAs 
(and all problems in P) are in NCi if and only if circuits of size n can always 
be written as expressions of size n. Although circuits of size n would a priori 
become expressions of size 2™ (since we have to repeat a sub-expression every 
time the circuit accesses the corresponding gate) just as the expressions in (1) 
grow exponentially with t, it hasn't actually been proven that this is necessary. 
Whether NC = P remains as open, and as important, an unsolved problem of 
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complexity theory as whether P = NP. 

In the meantime, we pose several open questions. 

1) We conjecture that CAs based on any finite group are in NCt for some 
k. We show in |ll| that CAs based on nilpotent and solvable groups are in 
NCi and NC2 respectively, but it is still unclear whether all finite groups are 
efficiently predictable. 

2) In analogy with the rocks-scissors-paper CA, spiral waves in two dimen- 
sions seem to be intermediate in complexity between linearity and computational 
universality. How hard are they to predict? 

3) By grouping pairs of sites together, we can turn the two-state, r = 1 
elementary CA rules of jll] into algebras with four elements. For which of these 
256 rules do these algebras have enough structure to be predicted efficiently? 
The algebraic relationship of these "block maps" and the underlying CA is 
explored in |2C[] . 

We hope through this kind of approach to develop an understanding of what 
kinds of nonlinear dynamics have enough structure to be easily understood 
and predicted, and to create a hierarchy of nonlinear systems of increasing 
complexity in the same spirit as the space and time hierarchies of algorithmic 
complexity theory [ fl6| and the Chomsky hierarchy of language and automata 
theory ||. 
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